Distributional reinforcement learning with unconstrained monotonic neural networks

نویسندگان

چکیده

The distributional reinforcement learning (RL) approach advocates for representing the complete probability distribution of random return instead only modelling its expectation. A RL algorithm may be characterised by two main components, namely representation together with parameterisation and metric defining loss. present research work considers unconstrained monotonic neural network (UMNN) architecture, a universal approximator continuous functions which is particularly well suited different representations distribution. This property enables efficient decoupling effect function class from that metric. paper firstly introduces methodology (PDF, CDF QF). Secondly, novel named deep Q-network (UMDQN) presented. To authors' knowledge, it first method supporting three, valid Lastly, in light this new algorithm, an empirical comparison performed between three quasi-metrics, Kullback-Leibler divergence, Cramer distance, Wasserstein distance. results highlight strengths weaknesses associated each important limitation

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

متن کامل

reinforcement learning in neural networks: a survey

in recent years, researches on reinforcement learning (rl) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. neural network reinforcement learning (nnrl) is among the most popular algorithms in the rl framework. the advantage of using neural networks enables the rl to search for optimal policies more efficiently in several real-life applicat...

متن کامل

Planning with neural networks and reinforcement learning

planning with neural networks, time limits of discounted reinforcement learning Planning, taskability, Dyna-PI architectures Dyna-PI architectures: focussing, forward and backward planning, acting and (re)planning. Tested with... Ideas from problem solving and

متن کامل

Stable reinforcement learning with recurrent neural networks

In this paper, we present a technique for ensuring the stability of a large class of adaptively controlled systems. We combine IQC models of both the controlled system and the controller with a method of filtering control parameter updates to ensure stable behavior of the controlled system under adaptation of the controller. We present a specific application to a system that uses recurrent neur...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2023

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2023.02.049